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INTRODUCTION 


In  the  region  analysis  work  done  by  Brice  and  Fennema  (1969),  a  one- 

pass  algorithm  was  used  to  initially  partition  the  picture  into  named, 

homogeneous,  connected  components  and  to  set  up  the  region  structure 

(AROS).  This . one-pass  algorithm  is  rather  complex  in  nature,  and  several 

alternatives  have  been  suggested,  one  of  which  (a  two-pass  algorithm)  we 

have  chosen  to  use  for  comparison  purposes  because  of  its  simplicity  and 

* 

apparent  efficiency. 

Both  algorithms  use  an  L-shaped  window  to  compare  the  elements  of 
the  picture  array.  If  two  points  differ  in  properties  (only  horizontal 
and  vertical  comparisons  are  made),  a  pair  of  elementary  vectors  are 
placed  to  separate  them  (see  Figure  1). 


The  curves  made  up  of  these  elementary  vectors  partition  the  picture 
into  regions  and  serve  as  the  boundaries  of  these  regions.  The  two  al¬ 
gorithms  discussed  here  differ  in  the  way  they  label  the  picture  elements 
and  the  boundaries.  Section  II  is  a  description  of  the  one-pass  algorithm, 
Section  III  explains  the  two-pass  algorithm,  and  Section  IV  is  a  com¬ 
parison  of  their  performance. 

* 

This  work  was  sponsored  by  the  Advanced  Research  Projects  Agency  and  the 
Rome  Air  Development  Center  under  Contract  F30602-69-C-0056  and  by  the 
Advanced  Research  Projects  Agency  and  the  National  Aeronautics  and  Space 
Administration  under  Contract  NAS12-2221. 
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II  THE  ONE-PASS  ALGORITHM 


A.  Background 

The  structure  on  which  both  of  these  algorithms  operate  is 
made  up  of  two  arrays:  a  picture  array  T1  and  a  component  array  T2. 
Each  word  of  the  picture  array  is  two  picture  elements,  each  of  which 
is  divided  into  the  bit  fields  of  Figure  2.  Here  S  is  a  special  bit 

S  P  V 


Figure  2 

to  denote  curve  headers;  P  is  a  field  that  contains  the  grayscale  of 
(i,j)  when  S  =  0  or  a  pointer  to  a  T2  word  when  S  =  1.  V  contains  the 
vector  code  of  the  elementary  boundary  vectors  that  exist  near  the 
point  (i,j)  (see  Figure  3). 


Figure  3 


The  component  array  T2  is  an  array  of  words  (called  component 
words)  having  the  format  of  Figure  4.  One  of  these  words  is  assigned  to  each 


R  B  B  G 

_ P  v _ 


Figure  4 
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curve  header  (picture  element  with  S  =  1).  The  R  field  contains  the  name 

(in  code  form)  of  the  region  to  which  the  point  belongs:  the  D  and  B 

P  v 

contain  the  point  and  vector  that  begin  the  boundary  component  tagged  by 

this  word.  (B  is,  of  course,  the  pointer  to  the  picture  element  that 
P 

points  to  this  word. )  Lastly,  G  is  the  grayscale  of  the  point  B  . 

P 

The  boundary  of  the  region  Rl  is  then  extracted  by  scanning  T2 
for  all  the  words  with  Code  Rl  in  their  R  fields,  going  to  the  beginning 
vector  (point)  and  following  the  boundary  using  the  SUCCESSOR  function, 
which  goes  from  a  vector  VI  to  the  next  leftmost  vector  V2,  as  in  Figure  5. 

V2 

A 

VI 

- ► 

T 

Figure  5 

B.  Some  Notions 

To  simplify  the  presentation  of  the  algorithm,  we  shall  make 
some  conventions  in  terminology. 

First  we  will  always  by  convention  denote  the  X  field  of  a 
word  (or  half  word)  Y  by  X(Y).  For  example,  V(A)  is  the  V  field  of  the 
point  A.  G(T2(P(PT)))  is  the  G  field  of  the  element  of  T2  pointed  at 
by  the  P  field  of  PT,  and  thus  it  contains  the  grayscale  of  PT.  Next, 
suppose  PT  =  (i,j)  is  a  picture  point  (half  word)  when  we  define  the 
grayscale  function 
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P (PT)  if  S (PT)  =  0 


GS(PT)  =  < 

(g(T2(P(PT)))  if  S (PT) 

Because  an  elementary  vector  is  defined  by  a  point  PT  and  a  vector  code, 
v,  we  will  denote  this  vector  (PT,v).  Lastly,  remembering  that  SUCCESSOR 
(SUCC)  is  the  name  of  the  function  used  in  following  the  boundary  and 
PREDECESSOR  (PRED)  is  its  inverse,  we  define  3  functions: 


R  (PT,X) 
P 


(R(T2 (P(PT) ))  if  S(PT)  =  1 


Jr  ((pred(pt,x)))  if  s(pt)  =  o 

v  p 


!R (T2 (P (PT) ) )  if  S (PT)  =  1 
R  ((SUCC  (PT,X) ) )  if  S (PT )  =  0 

S 


!R  (PT,8)  if  (AND  PT,  8)  =  1 
p 

RG  (LFT(PT) )  otherwise 

where  LFT(PT)  is  the  point  to  the  left  of  PT . 

C.  The  Algorithm 

The  algorithm  is  best  explained  by  the  flowchart  of  Figure  6 
with  the  following  notes:  Box  A  (and  V(B),8),  is  a  check  for  the 
presence  of  bit  8  in  the  V  field  of  B,  which  means,  since  GS(A)  r  GS(B), 
that  the  configuration  in  Figure  7  has  been  encountered. 

This  means  (as  far  as  we  know  at  this  point)  that  B  is  a  point 
of  a  new  region. 

Box  B  tests  if  the  predecessor  of  (A,l)  is  (D, 2).  If  so,  we 
have  the  configuration  of  Figure  8,  meaning  that  we  have  encountered  a 
new  curve  for  the  boundary  of  the  region  containing  A. 


Figure  6 
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D 
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t 

Figure  7 


T 

Figure  8 


The  other  tests  are  of  a  similar  nature  and  deal  with  the  con¬ 
nection  of  two  partial  curves  (boxes  C  and  D)  and  the  merging  of  two 
partial,  expanded  regions  (box  D) .  This  procedure  is  executed  in  a 
raster-like  fashion  until  the  entire  picture  has  been  processed. 


Ill  THE  TWO- PASS  ALGORITHM 


The  basic  data  structure  remains  the  same  as  for  the  one-pass 
algorithm,  but  a  bit  table  of  four  bits  per  picture  point  is  added  for 
marking  purposes,  one  for  each  possible  vector.  During  the  first  pass 
the  separating  vectors  are  created,  and  during  the  second  pass  the 
regions  and  components  are  named. 

A.  First  Pass 

The  first  pass  is  merely  to  create  and  store  in  the  array,  in 
the  proper  field,  the  elementary  vectors  of  each  boundary,  using  the 
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same  L-shaped  window  as  the  one-pass  algorithm,  but  without  naming 
regions  and  components.  During  this  pass  each  picture  point  is  visited 
twice. 

B.  Second  Pass 

The  algorithm  proceeds  from  left  to  right,  following  a  line 
until  it  finds  an  unmarked  vector.  (The  bit  table  is  used  to  mark  each 
possible  separating  vector.)  When  an  unmarked  vector  is  found,  the  al¬ 
gorithm  marks  it,  creates  a  special  pointer  that  indicates  the  beginning 
of  a  new  component,  and  makes  a  new  entry  in  T2  similar  to  that  used  in 
the  one-pass  algorithm.  Then,  starting  from  this  vector,  the  boundary 
is  followed  and  every  vector  is  marked  until  a  marked  vector  is  found 
(obviously  the  starting  vector).  When  following  the  boundary,  the  al¬ 
gorithm  sets  up  a  counter  to  which  -1  is  added  when  a  left  turn  occurs 
and  +1  is  added  when  a  right  turn  occurs.  At  the  end  of  the  following 
procedure,  if  the  result  of  the  counter  is  negative,  the  boundary  is 
counterclockwise-oriented;  it  is  clockwise-oriented  if  the  result  is 
positive.  A  clockwise  boundary  means  that  there  is  a  hole  in  one  region, 
and  therefore  there  is  a  region  with  more  than  one  component  (see  Figure  9). 

If  the  result  of  the  counter  at  this  end  of  the  following  pro¬ 
cedure  is  positive,  the  name  of  the  region  to  which  this  component  be¬ 
longs  is  found  by  stepping  to  the  left  along  one  line  until  a  marked 
vector  is  found  and  by  following  this  boundary  to  its  beginning.  Other¬ 
wise,  a  new  region  name  is  created  and  stored  in  T2,  and  pointers  are 
set  up  in  the  same  manner  as  for  the  one-pass  algorithm.  During  this 
second  pass  every  point  is  visited  at  least  once. 

IV  COMPARISON  OF  THE  TWO  ALGORITHMS 

The  two-pass  algorithm  has  the  obvious  advantage  of  simplicity, 
and  the  one-pass  one  may  be  faster  in  execution  time,  but  it  seems 
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pretty  difficult  to  estimate  what  would  be  the  running  time  of  the  two 
algorithms  without  coding  and  executing  them.  In  order  to  get  a  rough 
approximation  of  these  times,  we  decided  to  simulate  the  action  of  both 
algorithms  on  an  illustrative  picture  that  includes  most  of  the  problems 
met  in  a  typical  partition  (see  Figure  9). 

A.  Common  Operations 

The  basic  operations  involved  in  both  algorithms  are:  visiting 
each  picture  element,  comparing  its  value  with  the  value  of  the  adjacent 
element  in  the  L-shaped  window,  and  putting  in  the  elementary  separating 
vector  if  needed.  These  operations  are  the  same  for  both  algorithms  and 
therefore  have  no  influence  on  the  comparison. 

B.  Differences 

Besides  these  common  operations,  the  one-pass  algorithm  involves 
mostly  the  following  operations: 

Follow:  Using  the  SUCCESSOR  or  PREDECESSOR  function,  the 

program,  during  this  procedure,  follows  the  boundary  until 
a  special  pointer  is  encountered.  The  cost  of  this  opera¬ 
tion  is  measured  in  the  number  of  elementary  vectors 
(arrows).  The  basic  operation,  consisting  of  passing 
from  one  vector  to  its  successor,  has  been  estimated  as 
30  ps.  (A  sample  program  has  been  written,  ) 

Arrow  test:  During  the  course  of  the  execution,  the  program 
has  to  test  for  a  certain  arrow  configuration.  The  basic 
instruction  is  to  skip  on  mask  bits.  The  cost  has  been 
evaluated  to  5  ps. 

In  the  same  condition,  the  two-pass  algorithm  involves,  be¬ 
sides  the  operations  already  mentioned,  the  following  operations: 
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Follow  (with  orientation):  The  boundary  is  followed,  and 


a  counter  is  altered  by  -1  or  +1,  depending  on  whether  the 
boundary  turns  to  the  right  or  to  the  left.  The  cost  of 
this  operation  is  a  little  larger  than  the  usual  follow; 
it  has  been  estimated  to  be  36  ps. 

Marking:  While  the  above  operation  is  performed,  each 

unmarked  vector  is  marked.  The  basic  operation  includes 
a  test  and  an  access  to  the  bit  table;  its  cost  has  been 
estimated  to  be  10  ps. 

Scanning:  During  the  second  pass,  the  program  scans  the 

picture  array,  looking  for  an  unmarked  vector.  The  cost 
of  the  basic  scan  from  one  picture  element  to  another  has 
been  estimated  to  be  10  ps. 

With  these  definitions  and  estimations,  the  performance  of  the 
two  algorithms  for  the  same  picture  (Figure  9)  is  compared  on  Table  I. 

V  CONCLUSION 

As  far  as  our  estimations  are  correct,  it  seems  that  the  one-pass 
algorithm  works  twice  as  fast  as  the  two-pass  algorithm.  However,  given 
the  same  picture,  the  two-pass  algorithm  depends  only  on  the  number  of 
curves,  but  the  one-pass  algorithm  depends  on  the  nature  of  the  curves. 
The  worst  case  for  the  one-pass  algorithm  would  be  the  case  where  there 
are  a  lot  of  45°  lines  (with  respect  to  the  horizontal).  This,  however, 
is  not  a  problem  in  a  real  environment. 

A  decision  has  been  made  to  implement  the  one-pass  algorithm  for 
two  reasons:  It  seems  faster,  and  it  consumes  less  space  since  it  needs 
no  bit  table. 
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Table  I 


Operations 

Time 

(ns) 

A 

One-Pass 

B 

Two- Pass 

Difference 

A-B 

Follow 

117  X  30  =  3510 

38  X  30  =  1140 

2370 

Arrow  test 

200  X  5  =  1000 

0 

1000 

Follow  (with  orientation) 

36 

0 

112  X  36  =  4032 

-4032 

Marking 

10 

0 

112  X  10  =  1120 

-1120 

Scanning 

10 

0 

200  X  10  =  2000 

-2000 

TOTAL  (p.s) 

4510 

8292 

-3782 

